Search CORE

5 research outputs found

A Novel Two-Stage Spectrum-Based Approach for Dimensionality Reduction: A Case Study on the Recognition of Handwritten Numerals

Author: Mohammad Amin Shayegan
Ram Gopal Raj
Saeed Aghabozorgi
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2014
Field of study

Dimensionality reduction (feature selection) is an important step in pattern recognition systems. Although there are different conventional approaches for feature selection, such as Principal Component Analysis, Random Projection, and Linear Discriminant Analysis, selecting optimal, effective, and robust features is usually a difficult task. In this paper, a new two-stage approach for dimensionality reduction is proposed. This method is based on one-dimensional and two-dimensional spectrum diagrams of standard deviation and minimum to maximum distributions for initial feature vector elements. The proposed algorithm is validated in an OCR application, by using two big standard benchmark handwritten OCR datasets, MNIST and Hoda. In the beginning, a 133-element feature vector was selected from the most used features, proposed in the literature. Finally, the size of initial feature vector was reduced from 100% to 59.40% (79 elements) for the MNIST dataset, and to 43.61% (58 elements) for the Hoda dataset, in order. Meanwhile, the accuracies of OCR systems are enhanced 2.95% for the MNIST dataset, and 4.71% for the Hoda dataset. The achieved results show an improvement in the precision of the system in comparison to the rival approaches, Principal Component Analysis and Random Projection. The proposed technique can also be useful for generating decision rules in a pattern recognition system using rule-based classifiers

Crossref

Directory of Open Access Journals

A New Dataset Size Reduction Approach for PCA-Based Classification in OCR Application

Author: Mohammad Amin Shayegan
Saeed Aghabozorgi
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2014
Field of study

A major problem of pattern recognition systems is due to the large volume of training datasets including duplicate and similar training samples. In order to overcome this problem, some dataset size reduction and also dimensionality reduction techniques have been introduced. The algorithms presently used for dataset size reduction usually remove samples near to the centers of classes or support vector samples between different classes. However, the samples near to a class center include valuable information about the class characteristics and the support vector is important for evaluating system efficiency. This paper reports on the use of Modified Frequency Diagram technique for dataset size reduction. In this new proposed technique, a training dataset is rearranged and then sieved. The sieved training dataset along with automatic feature extraction/selection operation using Principal Component Analysis is used in an OCR application. The experimental results obtained when using the proposed system on one of the biggest handwritten Farsi/Arabic numeral standard OCR datasets, Hoda, show about 97% accuracy in the recognition rate. The recognition speed increased by 2.28 times, while the accuracy decreased only by 0.7%, when a sieved version of the dataset, which is only as half as the size of the initial training dataset, was used

Crossref

Directory of Open Access Journals

A new method for Arabic/Farsi numeral data set size reduction via modified frequency diagram matching

Author: Mohammad Amin Shayegan
Saeed Aghabozorgi
Publication venue: 'Emerald'
Publication date
Field of study

Crossref

Fast content access and retrieval of JPEG compressed images

Author: Ghanbari Mohammad
Mehrabi Mahdi
Shayegan Mohammad Amin
Zargari Farzad
Publication venue: Elsevier BV
Publication date: 01/08/2016
Field of study

Fast content access and content based retrieval of images are among the common web and signal processing applications; on the other hand, JPEG is the dominant format for image compression in a wide range of applications. In this paper, a simple and fast method for content access and retrieval of JPEG coded images is presented which does not require complete decompression of coded images. The presented method uses DCT coefficients of coded blocks in the JPEG bit stream to extract average values of various size picture blocks namely DC values. The DC values provide an approximation of the coded image, which can be employed to construct a lower resolution picture or color histogram of the JPEG coded image for retrieval and other applications without full decompression of the image

University of Essex Research Repository

Intensification of oxidative desulfurization of gas oil by ultrasound irradiation: Optimization using Box–Behnken design (BBD)

Author: Akbari
Akbari
Bezerra
Bhasarkar
Bhasarkar
Campos-Martin
Chakma
Chen
Chen
Chica
Choi
Dai
Dai
De Filippis
Dehkordi
Dehkordi
Dehkordi
Duarte
Fan
Hernández-Maldonado
Ismagilov
Ito
Koblov
Levy
Ma
Margeta
Margeta
Margeta
Mei
Mjalli
Mohammad Amin Sobati
Mohammad Reza Jalali
Nehlsen
Rubio
Seeberger
Shayegan
Sobati
Sobati
Soleimani
Stanislaus
Wan
Wan
Wang
Wang
Wang
Wu
Zhao
Žnidarčič
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref